Can We Trust Test Outcomes?
نویسندگان
چکیده
Software development is an on-going process. To ensure that the changes made do not break previous functionality, a regression test suite is maintained along with the source code. The correctness and the side effects of code changes are judged based on the output of the tests. However, are these tests’ outcomes always correct? Does a test failure imply a bug and do all tests passing imply a bug free code? The answer is no. Experienced researchers and practitioners know that there is an inherent non-determinism in testing. This paper contributes the first comprehensive study on test outcome non-determinism. We target 153 open-source projects with a novel methodology starting from the commit logs, which enabled us to identify many unreported cases of tests with nondeterministic outcomes (also called flaky tests). We were able to detect 1129 commits from 51 projects that are related to flaky tests, out of which we study 486 commits which are about distinct flaky tests. 61% of these flaky tests do not have associated bug reports. In our study, we focus on 81 commits to analyze in depth. We identify 11 causes of flakiness, provide examples for each, and give recommendations on how to fix them. We also discuss other interesting findings such as the effectiveness of common fixes, the evolution of flaky tests, how hard it is to reproduce a flaky test, and how hard it is to patch it.
منابع مشابه
A Single Femoral Component for All Total Hip Replacements Performed by a Trust? Does This Affect Early Clinical and Radiological Outcomes?
Background: Hospitals may be under pressure to implement cost saving strategies regarding prosthesis choice. Thismay involve the use of components which are not the first preference of individual surgeons, or those they have littleexperience with. We aim to examine the effect of standardizing the type of femoral stem used in a single trust, anddetermine whether this is safe practice, particular...
متن کاملTrust in Buyer-Supplier Relationships: Supplier Competency, Interpersonal Relationships and Outcomes
In this paper, we develop a model that links the antecedents of trust, trust itself and outcome success. We test the model using data from a questionnaire and present the path analytical results. The data suggest that performance outcomes are based on both direct effects from objective supplier performance and indirect effects through trust. However, many of the “people oriented” trust enablers...
متن کاملA Model for E-banking Trust In Iran’s Banking Industry
One of the crucial factors affecting success of e-banking services is customer’s trust. Acquiring customer trust depends on different variables that an e-banking struggles to control. This research aims to explain and clarify the inter-relationships and the interaction effects among customer’s online trust and other relevant factors such as satisfaction and loyalty in e-banking. We developed a ...
متن کاملSuperior Capsule Reconstruction: What Do We Know?
The management of irreparable rotator cuff tears remains challenging. Since its introduction by Mihata in 2012, superiorcapsule reconstruction (SCR) has grown in popularity at an astonishingly rapid rate. The aim of this article is to providea comprehensive review of the available literature, in order to highlight what has so far been published on SCR,covering all aspects including biomechanica...
متن کاملSolving the Unconstrained Optimization Problems Using the Combination of Nonmonotone Trust Region Algorithm and Filter Technique
In this paper, we propose a new nonmonotone adaptive trust region method for solving unconstrained optimization problems that is equipped with the filter technique. In the proposed method, the various nonmonotone technique is used. Using this technique, the algorithm can advantage from nonmonotone properties and it can increase the rate of solving the problems. Also, the filter that is used in...
متن کاملThe Relationship between Modified Biophysical Profile, Standard Biophysical Profile, and Neonatal Outcomes of High-risk Pregnancies
Introduction: High-risk pregnancies can result in many complications for the fetus. In these pregnancies, different tests such as non-stress test (NST), biophysical profile (BPP), oxytocin contraction stress test (OCT), and Doppler sonography can be used to evaluate fetal health. As standard BBP requires more time and expertise, in this study, we evaluated the relationship between standard BPP,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014